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ABSTRACT 

Using accurate positions from very deep radio observations to guide multi-object Keck spec- 
troscopy, we have determined a substantially complete redshift distribution for very luminous, distant 
submillimeter-selected galaxies (SMGs). A sample of 73 redshifts for SMGs in seven fields contains 
a surprisingly large number of 'associations': systems of SMGs with Mpc-scale separations, and red- 
shifts within 1200 km s -1 . This sample provides tentative evidence of strong clustering of SMGs at 
z ~ 2 — 3 with a correlation length ~ (6.9 ± 2.1)h~ 1 Mpc, using a simple pair-counting approach that 
is appropriate to the small, sparse SMG samples. This is somewhat greater than the well-determined 
correlation lengths for both z ~ 3 optical-UV color-selected Lyman break galaxies (LBGs) and z ~ 2 
QSOs. This could indicate that SMGs trace the densest large-scale structures in the high-redshift 
universe, and that they may either be evolutionarily distinct from LBGs and QSOs, or subject to a 
more complex astrophysical bias. 

Subject headings: galaxies: clusters: general - galaxies: evolution - galaxies: formation - galaxies: 
starburst - large-scale structure of universe - cosmology: observations 



1. INTRODUCTION 

Since 1997 several hundred galaxies have been discov- 
ered using imaging arrays at millimeter and submillimc- 
ter wavelengths (Blain et al. 2002; Smail et al. 2002; 
Scott et al. 2002; Borys et al. 2003; Webb et al. 2003; 
Greve et al. 2004a). The difficulties of identifying these 
galaxies at other wavelengths, and moreover determining 
their redshifts, are significant, owing both to the coarse 
positional accuracy (~ 8" ) of the discovery images and 
to the faint optical magnitudes of their proposed coun- 
terparts. By exploiting the high-redshift analog of the 
far-infrared-radio correlation between hot dust and ra- 
dio synchrotron emission, which are powered by young, 
hot stars and their associated supernovae respectively, 
sub-arcsecond radio positions can be found for a substan- 
tial fraction of submillimeter-selected galaxies (SMGs). 
This enables efficient multi-object optical spectroscopy 
to search for redshifts, given that the surface density of 
radio-detected SMGs on the sky is ~ 300 deg -2 (Chap- 
man et al. 2003a), and thus is well matched to the field of 
view and multiplex gain of current spectrographs. This 
technique was first tried using the Keck LRIS-B spec- 
trograph, which countcrintuitively detected identifiable 
restframe ultraviolet spectral lines and features in many 
of these galaxies. The redshift distribution of a large 
sample of 73 of these galaxies in seven independent fields 
listed in Table 1 is presented by Chapman et al. (2004a). 
The spectroscopic completeness for redshift determina- 
tions is ~ 50% for SMGs selected purely on the grounds 
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of submillimcter flux density greater than 5 mJy at a 
wavelength of 850 /xm, and ~ 70% of for SMGs with the 
same submillimeter flux density and radio detections at 
1.4 GHz brighter than 30/xJy. 

The remaining ~ 30% of radio-detected SMGs with- 
out redshifts are likely split between those at higher red- 
shifts and any with cooler dust temperatures and lower 
luminosities, neither of which yield radio detections, and 
galaxies in the redshift range 1.2 < z < 1.8, whose opti- 
cal continua are generally too faint for the detection of 
rest-frame UV absorption lines, and for which any rest- 
frame UV emission lines are not redshiftcd into the wide 
spectral range of LRIS (Chapman et al. 2004a). 

This reasonably complete redshift distribution con- 
firms absolutely that the SMGs are an important high- 
redshift galaxy population, and that galaxies within a 
factor of 5 in submillimeter flux density of the 5-mJy 
850-^im detection limit of this sample dominate the lu- 
minosity density from galaxies at redshifts z ~ 2. The 
redshift distribution can be represented adequately by a 
Gaussian with z — 2.4 and a = 0.65, with almost all 
SMGs found over the redshift range 1.5 < z < 3. SMGs 
are found in relatively small fields on the sky, less than 
10' across, but are spread very widely in redshift from at 
least 0.8 to 3.4. This means that the SMG survey vol- 
umes have a uniquely long thin pencil geometry, which 
extends in depth by 2-3 Gpc, but across the sky only as 
far as 5 Mpc. 

The spatial distribution of SMGs is important, as the 
detection of any clustering signal can provide information 
about the distribution of galaxies as a function of dark 
matter density. In principle, in a dark matter-dominated 
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universe the strength of clustering should correlate with 
the mass of halos (Bardeen et al. 1986), and thus its 
measurement provides an insight into the masses of the 
dark halos that host luminous galaxies (e.g. Overzier et 
al. 2003 and references therein). 

The clustering strength of many types of high- and low- 
rcdshift galaxies has been probed by a wide variety of 
methods. At rcdshifts less than those of SMGs, z < 1.3, 
the large-scale structure of galaxies is being mapped in 
great detail by the DEEP2 survey (Coil et al. 2003). At 
higher mean redshifts, the clustering of many hundreds 
of restframe UV/optically-selected, spectroscopically- 
confirmed Lyman-break galaxies (LBGs) can be mea- 
sured (Adclbcrgcr et al. 1998). Recent values of correla- 
tion lengths 2 are r = 4/i _1 Mpc (Porciani & Giavalisco 
2002) and r = 2.7/i" 1 Mpc (Ouchi et al. 2001). The 
published samples of LBGs lie at higher redshifts than 
most SMGs, which could on its own explain the lack of 
observed correlation between the positions of LBGs and 
SMGs (Peacock et al. 2000; Chapman et al. 2000; Webb 
et al. 2003) , even if they are drawn from a common un- 
derlying population of galaxies described by the same 
form of evolution. Lowcr-redshift counterparts to LBGs 
can be isolated by varying the color selection conditions 
(see Steidel et al. 2004). Spcctroscopically-confirmed 
samples of these galaxies can be compiled in parallel to 
surveys for SMG redshifts using the same multi-object 
slitmasks, and should ultimately provide an excellent 
opportunity to determine the relative three-dimensional 
distribution of LBGs and SMGs. The clustering of much 
rarer QSOs and relatively faint (~ 10 mJy) radio galaxies 
at redshifts that overlap with the SMGs has been probed 
by Croom et al. (2002) and Overzier et al. (2003), who 
report correlation lengths r — 5 and 6/i _1 Mpc respec- 
tively. However, these surveys are more sparsely sam- 
pled, and cover larger areas of sky (of order thousands of 
square degrees), so the typical source separation is much 
greater than the correlation length. 

Without redshifts to trace three-dimensional structure, 
only limits to the projected clustering strength of SMGs 
in two dimensions have been obtained (Carilli et al. 2000; 
Scott et al. 2002; Borys et al. 2003; Webb et al. 2003; 
see Fig. 1). This is unsurprising given the narrow, deep 
geometry of SMG surveys: multiple structures along the 
line of sight overlap and dilute the clustering signal. Note 
that even the best possible photometric rcdshifts derived 
from deep multicolor optical and ncar-IR imaging with 
Az ~ 0.1 are not sufficiently accurate to provide use- 
ful three-dimensional clustering information in the deep, 
narrow pencil-beam geometries of SMG surveys, which 
include very significant projection effects (see Fig. 2). 
The availability of a large number of spectroscopic red- 
shifts means that we can now remove these projection 
effects and investigate the clustering of the SMG popu- 
lation directly. 

Throughout this paper we assume a post- Wilkinson 
Microwave Anisotropy Probe cosmology with fl = 0.27, 
A = 0.73, and H = lOO/ikms" 1 Mpc" 1 with h = 0.71. 

2 The two-point correlation function of galaxies that describes 
the excess probability of finding a pair of galaxies separated by 
r over an unclustered distribution is observed to be a decreasing 
power- law with distance §(r) = (r/r )- -1 ' 8 (Coil et al. 2003). 
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Fig. 1. — ACF observed for 47 SMGs in the extended HDF field 
shown in Fig. 1 (Borys et al. 2003; Chapman et al. 2003a, 2004a). 
Other results for two-dimensional clustering of SMGs have been 
discussed by Carilli et al. (2000), Scott et al. (2002), Borys et al. 
(2003) and Webb et al. (2003). The errors on existing ACF data 
are much too great to measure the clustering signal without much 
larger SMG samples: rcdshifts arc essential to discern the cluster- 
ing of SMGs. The observed correlation functions of z ~ 1 EROs 
(Daddi ct al. 2001; r ~ 12ft- 1 Mpc), z ~ 3 LBGs (Adclberger et 
al. 1998; Porciani & Giavalisco 2002; r ~ 4/1" 1 Mpc) and z ~ 2.5 
SMGs assuming the correlation length rrj — 7h~ 1 Mpc determined 
in Section 3 (Tablet) are also shown. After projection, the SMGs 
and LBGs have similar ACFs, which arc much weaker than the 
ACF for the lower-redshift EROs. 

2. SIGNS OF CLUSTERING IN SMG SURVEYS 

There have been serendipitous detections of SMGs 
clustered in close proximity to other classes of high- 
rcdshift galaxies. For example there are detections of 
optically-selected galaxies at redshifts matching those of 
known SMGs and with separations of only ~ 100 kpc in 
the backgrounds of two gravitational lensing clusters of 
galaxies: at z = 2.80 in A370 (Ivison et al. 1998; Santos 
et al. 2004), and at z = 2.55 in A2218 (Kneib et al. 2004). 
SMGs were identified by Chapman et al. (2001) in the 
most overdense structure of LBGs at z ~ 3.1 (Steidel 
et al. 2000). Ovcrdcnsities of bright SMGs found near 
the targeted positions of some of the most extreme high- 
rcdshift radio galaxies at redshifts as great as z = 3.8 
(Ivison et al. 2000; Stevens et al. 2003) also provide evi- 
dence for some strong overdensities in the spatial distri- 
bution of SMGs, some confirmed by spectroscopy (Smail 
et al. 2003a, b). Can we find direct spectroscopic evi- 
dence of clustering in the general population of SMGs? 

In order to demonstrate the crucial importance of ob- 
taining rcdshifts for the SMGs before attempting to mea- 
sure their clustering properties, in Fig. 1 we determine 
the projected two-dimensional angular correlation func- 
tion (ACF) of 47 SMGs in the biggest available survey 
field (Borys et al. 2003; Chapman et al. 2004a). To 
measure a significant ACF for the SMGs the large frac- 
tional errors on the results shown in Fig. 1 indicate that 
a much larger sample of several hundred SMGs would be 
required, a challenge for the next generation of sensitive 
wide-field submillimetcr telescopes. 

We indeed find spectroscopic evidence for clustering of 
SMGs. In six of the seven fields containing SMGs with 
redshifts, "associations" of galaxies are found, with red- 
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shifts separated by less than 1200 km s -1 in radial veloc- 
ity (Table 1). This velocity is an approximate upper limit 
to the velocity dispersion of the richest clusters of galax- 
ies at the present epoch, and so should be larger than the 
velocity dispersion of any less evolved large-scale struc- 
tures at high redshifts. Furthermore, it is greater than 
the random offset velocity expected from emission gen- 
crated in randomly oriented outflowing galactic winds at 
high-redshifts (Erb et al. 2003). Most of the associations 
are pairs, but there is a group of three SMGs at z = 3.1 
in the highest overdensity in the survey of LBGs in the 
SA22 field (Steidel et al. 2000), and a group of five SMGs 
at z = 2 in the Hubble Deep Field (HDF; Fig. 2). SMGs 
are often found to have disturbed and probably interact- 
ing morphologies (e.g. Ivison et al. 1998; Chapman et 
al. 2003b). Note that the associations are not close inter- 
acting pairs of SMGs (such as, e.g., SMM J09431+4700; 
Neri et al. 2003; see also Swinbank et al. 2004), but are 
separated on much larger scales up to several Mpc. Their 
spatial separation is comparable both to the extent of the 
survey fields (Fig. 2) and to the scale of the local galaxy 
correlation length. It is possible that the successors of 
the SMGs may finally reside in the densest clusters, but 
at the observed epoch the SMGs with associated red- 
shifts are much more widely spread, presumably lying 
throughout infalling, pre-virialized filaments and walls 
in the large-scale structures of galaxies. 

In parallel to measuring the SMG redshifts, Chapman 
et al. (2004b) have found redshifts for a comparable num- 
ber of optically-faint radio galaxies (OFRGs) that are 
not detected at submillimeter wavelengths. Their optical 
spectra are similar to the SMGs, and it is very plausi- 
ble that these submillimeter-faint OFRGs are compara- 
bly luminous to the SMGs, but have dust temperatures 
too high to be detected at submillimeter wavelengths 
(Blain et al. 2004). The submillimeter-faint OFRGs 
display a similar abundance of systems with associated 
redshifts to the SMGs, both within the submillimeter- 
faint OFRG population itself (in the 03 hr, Lockman, 
SA13 and SA22 fields), and when cross-correlated with 
the SMG population (in the five-member HDF associa- 
tion and in the very overdense z = 3.1 structure in the 
SA22 field). Once Spitzer Space Telescope observations 
reveal whether these galaxies are truly as luminous as 
the SMGs, they offer to approximately double the den- 
sity of the sampling of large-scale structure provided by 
very luminous dusty galaxies. 

3. THE CORRELATION LENGTH OF SMG CLUSTERING 

To interpret our sparse data, we cannot use regular 
methods to determine clustering strength (Adelberger et 
al. 1998; Wilson et al. 2001) but must make use of the 
limited information as best we can. We compare the 
number of pairs of galaxies found in each field with ve- 
locity separations less than 1200 km s -1 (Table 1) with 
the number of pairs expected assuming a comoving cor- 
relation length that is fixed throughout the redshift range 
of the SMG sample. We further assume that the redshift 
distribution of the population is known accurately, and 
determine the space density of SMGs in each field that is 
required to match the observed number. This allows us 
to estimate the number of pairs expected in each field in 
the absence of clustering, as listed in Table 1 (A^' a jr ). To 
evaluate the number of pairs expected with a particular 



correlation function £, which is assumed to be constant 
in redshift, we carry out a double integral over volume 
for the product of the space density of SMGs N at two 
positions separated by a three-dimensional distance r, 
and the two-point correlation function £ : 



1 
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N(z,e,cf>)x (1) 



N(z', 6', cj)') x [1 + £(r)] dVdV. (2) 



The factor of a half corrects for double counting. One ra- 
dial variable is limited by the redshift range of the SMGs 
(values of z mm = 1.8 and z max = 4.0 were used, but note 
that the effects of changing z m - m to zero are less than 1%: 
the peaked distribution of N sets the size of iV pa ; r ), and 
the other the distance corresponding to relative velocity 
in the Hubble flow, with a limit Av set to the limiting 
velocity that defines a pair. This is usually 1200 km s -1 . 
Note that we integrate over the velocity range as we can- 
not be sure of the relative spatial distribution of galax- 
ies along the line of sight because of peculiar velocities, 
and so all galaxies that lie within Av of each other are 
counted. The solid angle of the survey in each rectan- 
gular field sets the limits on 9 and (f>. N is assumed the 
have the Gaussian redshift distribution discussed above, 
normalized to match the number of detections N ga \ in 
each field (Table 1). 

We then change the correlation length tq included in 
£(r), keeping 7 = —1.8, until the numbers of pairs pre- 
dicted by equation 1 matches the observed numbers of 
pairs A^p a i r (Table 1), either field by field or in a combi- 
nation of fields, as listed in Table 1. The uncertainties on 
the derived values of tq are determined by the range of ro 
values that match the Poisson errors on A ga i and -/V pa i r 
in each field. A pair in Table 1 is derived for ro = OMpc. 
Note that the overall result is presented with and with- 
out the SA22 field included, as the overdensity in that 
field was known prior to substantial submillimeter ob- 
servations, although the radio-pinpointed submillimeter 
observations were carried out before any redshifts were 
determined. 

We now illustrate the robustness of this pair-counting 
procedure in an LBG survey field with a more richly sam- 
pled redshift distribution. The largest and best popu- 
lated LBG survey field - the 850" x850" Westphal field 3 
- contains 192 galaxies with redshifts, and 912 galaxy 
pairs using our definition. As the LBG redshift distri- 
bution is approximately a Gaussian with z — 3.0 and 
<r z = 0.28, 594 pairs would be expected with no cluster- 
ing. This corresponds to an excess of pairs over random 
by a factor of 1.55, which requires a correlation length of 
r = (5.8 ± 0.3)/i _1 Mpc using our method of integrating 
over the two-point correlation function. By comparison, 
we obtain a correlation length ro = (6 ± 0.3)/i _1 Mpc 
in this field if we evaluate the two-point ACF against a 
mock catalog of random galaxy positions with the same 
redshift distribution (Landy & Szalay 1993), and then 
compare the results with the ACF obtained from an as- 
sumed three-dimensional correlation function using Lim- 
ber's equation e.g. Wilson 2003). Although this is a 
single relatively small field, and fluctuations in the cor- 
relation amplitude are certainly expected, the result from 

3 |ftp : / /ftp . astro, caltech. edu /pub/ccs /lbgsurvey | 
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Fig. 2. — Positions of radio-pinpointed SMGs in the richest three field in our redshift survey: from left to right these are; the HDF (the 
boundaries of the HDF-N and GOODS fields are also shown), Lockman Hole, and SA13 fields. SMGs in our catalog with spectroscopic 
rcdshifts are represented by open circles. Those in the redshift 'associations' present are shown by different filled symbols. These figures 
demonstrate the typical wide separation of the associations, which span the whole field. 



the pair-counting method used to analyze our small SMG 
sample is in accordance with that expected from more es- 
tablished methods. Note that the strength of clustering 
in this field appears to be greater than for the LBG sam- 
ple as a whole. 

Over all seven fields we find a correlation length ro = 
(6.9 ± 2.1)h~ 1 Mpc for the SMG sample. Only in the 
best-sampled HDF field is a significant result obtained 
in an individual field - r = (9.5 ± 3.3) h~ 1 Mpc - the 
other fields all include too few galaxies and pairs. Note 
that varying the choice of field area or Av makes rela- 
tively little difference to the result. If the size of the HDF 
field is doubled, then the calculated correlation length is 
reduced to r = (7.5 ± 3.2)/i _1 Mpc, consistent with the 
value above. If Av is reduced to 600 km s" 1 , then only 
two pairs of galaxies are lost from the 18 in the sample. 
The calculated correlation length is then increased by 
24%. If the value of Av is increased to 1800 km s" 1 , then 
an additional three pairs are counted, and the correlation 
length is reduced by 8%. Hence, within the uncertain- 
ties the results are robust to changes in the assumptions 
made. 

The correlation length for SA22 is greater, but note 
that SA22 includes a known strong overdensity at the 
redshift of the association of SMGs (Steidel et al. 2000), 
and so it is possible that the value for SA22 does not 
give a fair representation of the clustering of SMGs in a 
typical field. 

Using the larger spectroscopic catalogs of LBGs, we 
also conducted several additional tests of the results. We 
again verified that the predicted and measured counts of 
pairs are not very sensitive to reasonable changes in the 
chosen velocity range Av away from 1200 km s -1 . The 
results depend more strongly on the shape and size of the 
field, with smaller fields yielding a larger boost in the ex- 
pected number of pairs for a certain correlation length, 
as the correlation function £ is larger when sampled on 
smaller scales. We also randomized the redshifts of the 
LBG catalog and, as expected, found that the measured 



excess of pairs vanished. The pair-counting method is 
thus sensitive to real three-dimensional structure in the 
galaxy distribution. We also examined the effects of ran- 
domly dropping a redshift-independent fraction of the 
LBGs from the catalog, to simulate the effects of incom- 
pleteness in sampling a galaxy population with an well- 
known and independent correlation length measurement. 
The results do not change significantly, even if the sample 
becomes 50% incomplete. As we know that the catalogs 
of redshifts for radio-selected SMGs are probably com- 
plete at the 70% level (Chapman et al. 2004a), and that 
no redshifts in the range 1.8 < z < 4 arc preferentially 
excluded from our redshift sample, we can be confident 
in the accuracy of our results. 

A correlation length of ~ 7 ft" 1 Mpc for the SMGs is 
close to that predicted by Shu, Mao & Mo (2001), assum- 
ing that star-formation activity in massive halos follows 
a Schmidt Law with gas surface density; this is a very un- 
certain assumption given the apparent large masses, and 
complex dynamical structure in these galaxies (Frayer et 
al. 1998, 1999; Neri et al. 2003; T. R. Greve et al. 2004b, 
in preparation). We can also compare and contrast the 
correlation length result for the SMGs with the results 
for rarer, more sparsely sampled populations of galaxies. 
The sample of QSOs with a surface density of ~ 30 deg -2 
and z ~ 1.6 ±0.65 from the Two-Degree Field (2dF) sur- 
vey overlaps in redshift with the lower end of the SMG 
redshift distribution. The correlation function of the 2dF 
QSOs is smaller than that for the SMGs, ~ 5 ft," 1 Mpc 
(Croom et al. 2002). The 2dF correlation function is eval- 
uated over a very much larger area (~ 730 deg 2 ), and so 
is not prone to field-to- field variations. The correlation 
length for other samples of high-redshift galaxies, includ- 
ing very deep near-IR selected populations (Daddi et al. 
2003) and bright radio-galaxies (Rottgering et al. 2003) 
have also been evaluated at similar redshifts to our SMG 
samples. For the former, the survey areas are small in 
extent, and have relatively incomplete redshift distribu- 
tions, while for the latter, the survey volumes are sparsely 
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sampled. It is hence difficult to compare the correlation 
functions of these samples directly with SMGs at present. 

We are targeting a statistically reliable subset of our 
SMGs with known redshifts to detect CO emission lines 
using mm-wave interferometry and rest-frame optical 
nebular line emission using near-infrared spectrographs. 
The CO linewidth can be used to provide a dynamically- 
determined estimate of the mass within about lOkpc 
(Frayer et al. 1998; Neri et al. 2003; Genzel et al. 2003; 
T. R. Greve et al. 2004b, in preparation); a more accu- 
rate mass can be obtained if the emission can be resolved 
spatially. Initial indications are that SMGs have velocity 
dispersions that are greater by a factor of 2, and thus 
have dynamical masses greater by a factor of ~ 4, as 
compared with the dynamical masses of LBGs at z ~ 2 
(4 x 1O 1O M ; Erb et al. 2003). The half-light radii of 
SMGs appear to be larger than those of SMGs (Chap- 
man et al. 2003b), which would lead to a further increase 
in the relative mass of SMGs as compared with LBGs by 
a factor of 2. At z ~ 2 near-IR observations of resolved 
nebular line emission are easier, although published sam- 
ples of z ~ 3 LBGs are larger. LBGs appear to be typ- 
ically twice as massive at z — 2 as at z ~ 3. None of 
these direct dynamical measures indicate, however, that 
SMGs are as massive as the several 10 13 M halos their 
clustering properties would suggest (Fig. 2). Their cen- 
tral dynamical masses derived on 10 kpc length scales are 
likely to be a factor of only 3-4 times less than the mass 
of the dark-matter halo, extended to of order 200 kpc, 
given the steep r~ 3 decline in the expected profile of 
dark matter halos outside of the visible galaxy (Navarro, 
Frenk & White 1997). Recent near-IR and CO obser- 
vations of SMGs (Genzel et al. 2003; Tecza et al. 2004; 
T. R. Greve et al. 2004b, in preparation; Swinbank et 
al. 2004) tend to support the greater masses of SMGs as 
compared with LBGs. It is interesting to note that the 
masses of the dark-matter halos surrounding the large, 
well-studied sample of LBGs at z ~ 3 inferred from their 
clustering properties (~ 8 x 1O 11 M ; Adelberger et al. 
1998; Fig. 1) also exceed the central dynamical masses in- 
ferred from spatially structured asymmetric line profiles 
and velocity dispersions, but by a smaller factor than for 
the SMGs. If in fact SMGs can only form in the most 
massive halos, then in a hierarchical picture of galaxy 
formation, this could account for their apparent typical 
moderate redshifts (Chapman et al. 2003a, 2004). 

Differences between the inferred dynamical and dark 
matter masses of high-rcdshift galaxies provides a di- 
rect hint that more complex astrophysics in SMGs could 
break a simple link between clustering amplitude and 
mass (see Shu et al. 2001). For example, the time- 
dependence of merging has been claimed to both boost 
(from infall; Scannapieco & Thackcr 2003), and leave un- 
affected (Percival et al. 2003) the clustering properties of 
galaxies. Intuitively, very luminous SMGs that are only 
briefly visible in overdense regions might be expected to 
have a weaker clustering signal, as compared with a qui- 
escent population that has the same spatial distribution; 
however, the strength of clustering could be increased 
if the luminous SMG phase is triggered preferentially in 
overdense regions. Further observations to investigate 
the relationship between the luminous SMGs and their 
underlying dark matter distribution are necessary to ac- 
count for the effects of time evolution. Note that there 



is also certain to be strong field-to-field variation in the 
density of structures of associated redshifts for SMGs 
(Fig. 2). 

We stress that accurate spectroscopic redshifts are es- 
sential in order to reveal clustering properties of the 
SMGs. The same information cannot be derived from 
two-dimensional data. The expected ACFs for z ~ 1 ex- 
tremely red objects (EROs), z ~ 3 LBG, and SMGs as- 
suming the correlation lengths discussed above are shown 
in Fig. 1. The relatively narrow redshift range for the 
EROs leads to a large expected signal, while the LBGs 
and SMGs have a much weaker ACF due in part to the 
greater distance to both samples, and mainly to their 
broader redshift distributions, especially for the SMGs. 
The inferred correlation length for the SMGs is similar 
to the projected extent of the SMG survey fields, and so 
the relative spatial positions of the members of each asso- 
ciation reveal little information on the maximum extent 
and strength of the clustering present. We await much 
larger square degree-scale fields in order to sample the 
clustering properties of SMGs in a representative region 
of the high-rcdshift universe. 
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Fig. 3. - Comoving correlation length of the SMGs inferred 
from our survey in contrast to other populations of low and high- 
rcdshift galaxies (see the summary in Overzier et al. 2003). The 
horizontal error bar on the SMG point spans the range of redshifts 
over which SMG associations are found (Tablet). The solid line 
shows a representative model for the evolution of a certain over- 
density. The dashed lines show the expected correlation length of 
dark matter halos as a function of mass and redshift. 

4. CONSEQUENCES OF STRONG SMG CLUSTERING 

There are two immediate consequences of the presence 
of a strong clustering signal in the SMG survey. These 
concern both the nature of the SMG themselves, and the 
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galaxies that result when the SMG activity is completed. 
First, since the correlation length of SMGs appears to be 
somewhat larger than for both LBG and QSO surveys at 
comparable redshifts, it seems unlikely that the SMGs 
form a simple evolutionary sequence with either popu- 
lation. It is attractive to link together these different 
samples of high-redshift galaxies, but they are likely to 
be substantially distinct populations. If they are distinct 
then the large luminosities of the SMGs would not repre- 
sent a shorter-lived, more luminous phase during the ac- 
tive lifetime of more typical, less deeply dust-enshrouded 
star forming galaxies, and few powerful active galactic 
nuclei (AGNs) would burn out of the dust cocoons of 
SMGs to appear as optically visible QSOs. Second, if 
there are no complex biases at work, then the correlation 
length inferred for the SMGs is consistent with a form 
of evolution that subsequently matches the large comov- 
ing correlation length typical of evolved EROs at z ~ 1 
and of clusters of galaxies at the present epoch (Fig. 3). 
If this picture is correct, then the descendants of SMGs 
would be rare in the field, and found predominantly in 
rich cluster environments. However, this appears to be 
inconsistent with their space densities, since a progenitor 
of a rich cluster of galaxies is not expected to intersect 
every 100-arcmin 2 field sampled out to redshift z ~ 3. 
The condition for this to occur is a comoving density of 
proto-clusters on the order of 10 _6 Mpc~ 3 , in contrast 
to the observed comoving density of clusters with masses 
M > 8 x 10 14 M Q at z = 0, which is - lCT 7 Mpc- 3 
(Bahcall et al. 2003). However, it is possible that SMGs 
in more extended unvirialized wall and filament large- 
scale structures with masses of several times 10 13 M© at 
z ~ 2.5 (Fig. 3) that subsequently drain, merge and col- 
lapse into more massive clusters could provide a greater 
effective covering factor, to reconcile these values. More 
likely, either the halo masses are less extreme than in- 
ferred from a picture of simple one-to-one halo biasing 
at a given mass, or some other astrophysical biasing ef- 
fect must be at work. By considering the correlations 
between dark matter sub-halos, and a "halo occupancy 
distribution", it is possible to enhance clustering on a 
given halo mass scale (Berlind et al. 2003). Building 
a better observational view of the density of the envi- 
ronments of SMGs based on deep multiwaveband imag- 
ing and densely-sampled spectroscopy should allow these 
ideas to be tested. 

The suggestion of an evolutionary link between SMGs 
and passive EROs from their clustering strength may 
provide useful constraints on the lifetime of the SMG 
phase. The comoving space density of bright radio- 
detected SMGs at z ~ 2.5 is - 2.7 x l{r 5 Mpc -3 . The 
corresponding density of EROs at 0.7 < z < 1.5 is 
(2.2 ± 0.6) x 10~ 4 Mpc~ 3 in a sample with spectroscopic 
completeness of ~ 70% (Cimatti et al. 2002). The den- 
sity of EROs matches closely the comoving abundance 
of L* elliptical galaxies at the present epoch, which is 
~ 2 x 10~ 4 Mpc~ 3 . Hence, because the comoving abun- 
dance of SMGs is much less, even neglecting merging, 



the SMGs and EROs can only be part of a direct evo- 
lutionary sequence with the same density in space if the 
duty cycle of SMGs is short, less than 700 Myr, to gener- 
ate enough evolved, long-lived EROs in the 6 Gyr prior 
to z ~ 1. The resulting EROs could then evolve natu- 
rally into L* elliptical galaxies at the present epoch with 
a very modest amount of merging. The unknown time 
profile of the luminosity of SMGs would certainly also 
have an effect on the results. Direct accurate determina- 
tions of the masses of SMGs are critically important to 
investigate these links. The first hints from systematic 
CO observations are that SMGs are massive enough to 
generate the stellar populations of L* elliptical galaxies 
directly (Neri et al. 2003; T. R. Greve et al. 2004b, in 
preparation) . 

The correlation function of SMGs appears to be con- 
sistent with their being associated with massive dark- 
matter halos at high redshifts, halos more massive than 
those hosting LBGs and QSOs at comparable redshifts. 
The successors of SMGs could thus be associated with 
clusters of galaxies at the present epoch. A key test of 
the nature of the SMGs requires a larger sample of dy- 
namical mass estimates for the population, and a direct 
comparison of their spatial distribution as compared with 
populations of more common optical/UV-sclcctcd galax- 
ies in the same fields. 

5. SUMMARY 

The discovery of associated systems of high-redshift lu- 
minous SMGs, enabled by the measurement of a substan- 
tially complete redshift distribution (z ~ 2-3), hint that 
they represent a strongly clustered population. We de- 
rive a correlation length of (6.9±2.1)/i~ 1 Mpc in our best 
sampled field. Representative samples of high-redshift 
SMGs may thus be found in high-density environments 
that could evolve into rich clusters of galaxies at the 
present epoch. The large correlation length casts doubt 
on a simple evolutionary link with populations of high- 
redshift optical/UV-selected galaxies and QSOs, while 
offering the potential to use associations of SMG redshifts 
to signpost the densest regions of the high-redshift Uni- 
verse with only a modest requirement on spectroscopic 
observing time. A key goal is now to investigate and 
compare the masses of SMGs inferred from clustering 
measurements, and the dynamical masses inferred from 
near-IR and mm-wave CO spectroscopy. This could help 
to determine the nature of the power source and lifetime 
of the SMGs themselves, and reveal the galaxies at the 
present epoch that evolve from the SMGs. 
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TABLE 1 
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Note. — The total number of spectroscopic redshifts JV ga i and the numbers of pairs of galaxies TVpair within 1200 km s 1 in radial 
velocity at redshift z as in each field are listed, along with the pair abundance predicted in the absence of clustering AT' ir . Units of right 
ascension are hours and minutes, and units of declination are degrees and arcminutes. 
a This is a 5-galaxy association 

b These three SMGs lie in the richest spectroscopically confirmed high-redshift large-scale structure (Steidel et al. 2000), which contains 
29 of the 76 LBGs in this field. If the width of the LBG redshift distribution matched that of the SMG surveys, then only 15% of the 
LBGs would lie in this structure, not the 3/9 found in the SMG survey. The comoving density of LBGs in the best-sampled Westphal— 14 hr 
field is 4.6 X 10~ 4 Mpc -3 , while the comoving density of SMGs in all the survey fields is (2.7 ± 0.4) X 10~ 5 Mpc -3 , and in the HDF it is 
2.5 x 10~ 5 Mpc- 3 . 
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